Random Regression for Bayes Nets Applied to Relational Data
نویسندگان
چکیده
Bayes nets (BNs) for relational databases are a major research topic in machine learning and artificial intelligence. When the database exhibits cyclic probabilistic dependencies, the usual Bayes net product formula does not define valid inferences. In this paper we describe and evaluate a new approach to defining Bayes net relational inference in the presence of cyclic dependencies. The key idea is to define the random regression logprobability of a target node value (unnormalized) as the expected log-probability (unnormalized) associated with a random instantiation of the node’s Markov blanket. We provide a tractable closed form for random regression, which is equivalent to a loglinear model, but with the predictors scaled to be instance frequencies of relational patterns (features), rather than instance counts. Instance counts are used in inference models based on Markov networks. We carried out an empirical comparison on five benchmark databases with (i) weights as log-conditional probabilities using maximum likelihood estimates vs. (ii) general weights learned with Markov net methods. Maximum likelihood estimates took seconds to compute in comparison to hours for Markov net learning. With the frequency scaling, predictive accuracy for the conditional probability weights was competitive with the general weights.
منابع مشابه
Join Bayes Nets: A new type of Bayes net for relational data
Many databases store data in relational format, with different types of entities and information about links between the entities. The field of statistical-relational learning has developed a number of new statistical models for such data. Instead of introducing a new model class, we propose using a standard model class—Bayes nets—in a new way: Join Bayes nets contain nodes that correspond to t...
متن کاملA Tractable Pseudo-Likelihood Function for Bayes Nets Applied to Relational Data
Bayes nets (BNs) for relational databases are a major research topic in machine learning and artificial intelligence. When the database exhibits cyclic probabilistic dependencies, measuring the fit of a BN model to relational data with a likelihood function is a challenge [5, 36, 28, 9]. A common approach to difficulties in defining a likelihood function is to employ a pseudo-likelihood; a prom...
متن کاملJoin Bayes Nets: A New Type of Bayes net for Relational Data
Many real-world data are maintained in relational format, with different tables storing information about entities and their links or relationships. The structure (schema) of the database is essentially that of a logical language, with variables ranging over individual entities and predicates for relationships and attributes. Our work combines the graphical structure of Bayes nets with the logi...
متن کاملBayes Nets for combining logical and probabilistic structure
We outline a new approach to using Bayes nets for a probabilistic extension of a logical structure or schema. Many real-world data are maintained in relational format, with different tables storing information about entities and their links or relationships. The structure (schema) of the database is essentially that of a logical language, with variables ranging over individual entities and pred...
متن کاملModelling Relational Statistics With Bayes Nets (Poster Presentation SRL Workshop)
Class-level dependencies model general relational statistics over attributes of linked objects and links. Class-level relationships are important in themselves, and they support applications like policy making, strategic planning, and query optimization. An example of a class-level query is “what is the percentage of friendship pairs where both friends are women?”. To represent class-level stat...
متن کامل